CDS

Accession Number TCMCG064C30473
gbkey CDS
Protein Id XP_011097518.1
Location join(3759386..3759474,3760620..3760683,3761077..3761198,3761840..3761893,3761988..3762107,3762195..3762314,3762399..3762534,3762621..3762791,3762905..3763051,3763125..3763322,3763421..3763562,3763656..3763812,3763890..3764006,3764106..3764229,3764420..3764668)
Gene LOC105176420
GeneID 105176420
Organism Sesamum indicum

Protein

Length 669aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_011099216.2
Definition uncharacterized protein LOC105176420 [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category S
Description Rhamnogalacturonate lyase family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K18195        [VIEW IN KEGG]
EC 4.2.2.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAGATGGGTGGAGGGCTGTCAATACTAGGATGTCTTGTGATCATATTCTTGCTAGTCGATTGCAGCACCAAGATCCATGCACACAGGAGAAATCCAGAGAGTAATCAAGGGTCATCACCTCCAGTCAAGCTCCAAGTTTTGGACAACCATGTGGAGATAGACAATGGCATCGTTAAACTTACGTTGTTAAAGCCGAGCGGCATGATCACAGGCATCGGCTACAAAGGAATCAATAACGTACTTGAGTATCGCTCTAAAGAAACACAACGAGGCTATTGGGACATTGTGTGGAGCAGGCCAGAGATAGGCAAAAGTTTCTTCGACACGCTCCAATGTACAAGCTTTAAGGTTGTAGCAAAGACTAAGGATCAGGTTGAAGTTTCTTTCACAAAGACATGGAATGTCTCCCTTGGCAACAATGTCGTCCCCCTCAACATCGATAAAAGGTTCATAGTCCTGCGTGGAGTTTCAGGTTTTTACTCATATGCAATTTTCGAGCACTTGAAAGGATGGCCTGATCTAAACGTCGATGAAGCTAGAATCGCTTTCAAGCTTCACCAGGACATGTTCCATTACATGGCTATATCGGATGACAAGCAAAGGATCATGCCATCGGACCGCGACAGGACAGCAGGCCATGTTCTTGATTACAGAGAAGCTGTTTTGTTAACAAATCCTGGAAATCCAAACCTTAAAGGAGAGGTCGATGACAAATACCAGTACTCGTGCGAAAACAAGGATAATCGTGTCCACGGGTGGATTAGCTCTAATCCACGTGTCGGATTTTGGGTGATAACTCCCAGTGATGAGTTCCGGGCCGGTGGACCTGTGAAACCGGACCTCACATCACATGCTGGCCCGACTTCCTTAGCTATATTCTTTAGTGGGCATTATGCTGGTCCGAGTTTTGGAATTCGATTGCGCAATGGAGAGGCCTGGAAGAAGGTCTTTGGTCCTGTTTTCATCTACCTTAACTCGGGTTCGACAAATAATGCACTCTGGGAAGATGCTAAAAAACAGATGTCTGAAGAAACCAAGAAATGGCCGTATGATTTCCCGATGTCAGTGGATTTTCCTCATGCCAGTCAACGAGGTACCATCAGTGGTCGATTACTAGTTCGTGACAGGTACATCAGTAGAGAACTTATGCCGGCAAAATCAGCTTACGTCGGATTGGCTCCGCCTGGAAATGCTGGATCCTGGCAAGAAGAAACTAAGGGTTATCAATTTTGGACACGAACTGACGATAAGGGCTACTTCACGATACAAGGTGTCCGAGCAGACACTTACAACTTATACGCATCGATTCCAGGGATCATCGGAGACTACAAACACGACATAGATGTTATAATCAGACCTGGAAGCAAAGTTGGAATAGGTGATCTCGTGTACGATCCTCCAAGACACGGTCCAACGCTCTGGGAAATCGGGATCCCTGATCGTTCTGCAGCCGAATTCTACATACCTGATCCGGCCCCAGGCCTTGTAAACAAGTTGTTCATCAACCATAAAGACAAGTTTAGGCAATACGGGCTATGGGATAGATACACAGATTTGTATCCGACCGAAGATTTAGTTTATACAGTTGGCATCAGTGATTATCGTAAAGACTGGTTCTTTGCTCATGTTAACAGAAACATAGACAACAACTATACACCAACCACATGGCGGATTTTGTTCGATGTAAGAAATGTCAGCAGGAGAGGAACTTATACACTCAGGCTGGCTTTGGCTTCTGCTAACTTCGCTGAAATACAAGTATGGATCAACAACCCTTACGGTCATCGCCCTCTTTTCACAACACAACGGATAGGGAGGGACAACGCGATTGCAAGACATGGCATTCACGGGCAATACCGATTATATAGTATTAATTTATCAGGATTTCAACTAGTAAACGGGAGAAACACGATATATCTCAAGCAAGCAAGAGGGTCGAGCCCTTTCGCAGGAGTGATGTATGACTACATTCGGTTGGAAGGGCCTCCTCAGGCCTACTATAATTAG
Protein:  
MEMGGGLSILGCLVIIFLLVDCSTKIHAHRRNPESNQGSSPPVKLQVLDNHVEIDNGIVKLTLLKPSGMITGIGYKGINNVLEYRSKETQRGYWDIVWSRPEIGKSFFDTLQCTSFKVVAKTKDQVEVSFTKTWNVSLGNNVVPLNIDKRFIVLRGVSGFYSYAIFEHLKGWPDLNVDEARIAFKLHQDMFHYMAISDDKQRIMPSDRDRTAGHVLDYREAVLLTNPGNPNLKGEVDDKYQYSCENKDNRVHGWISSNPRVGFWVITPSDEFRAGGPVKPDLTSHAGPTSLAIFFSGHYAGPSFGIRLRNGEAWKKVFGPVFIYLNSGSTNNALWEDAKKQMSEETKKWPYDFPMSVDFPHASQRGTISGRLLVRDRYISRELMPAKSAYVGLAPPGNAGSWQEETKGYQFWTRTDDKGYFTIQGVRADTYNLYASIPGIIGDYKHDIDVIIRPGSKVGIGDLVYDPPRHGPTLWEIGIPDRSAAEFYIPDPAPGLVNKLFINHKDKFRQYGLWDRYTDLYPTEDLVYTVGISDYRKDWFFAHVNRNIDNNYTPTTWRILFDVRNVSRRGTYTLRLALASANFAEIQVWINNPYGHRPLFTTQRIGRDNAIARHGIHGQYRLYSINLSGFQLVNGRNTIYLKQARGSSPFAGVMYDYIRLEGPPQAYYN